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Abstract. A simpler neural-network approach is presented for the analysis of the top quark non-leptonic decay channel 
in events of the DO Collaboration. Results for the top quark signal are comparable to those found by the DO 
Collaboration by a more elaborate handUng of the event information used as input to the neural network. 
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1 Introduction 

The DO Collaboration has recently presented an analysis of its Fermilab Tevatron coUider data, aimed to estabhsh a tt 
signal with both top quarks decaying non-leptonically. The analysis makes use of neural networks (NN), whose input is 
mainly represented by thoroughly researched variables built from a complete and detailed jet reconstruction for the 
events [1,2]. 

This analysis, like previous theoretical NN studies of the same signal (see, e.g., [3]), is based on the presumption 
that NN's should be "helped" by insight we gather from the theoretical event representations developed to interpret the 
events. In other words, it is left aside the possibility that the NN acting directly on the raw data may find internal 
representations by itself, which are adequate for discrimination. 

The study presented here consists of a NN application with the latter approach: instead of using jets as 
intermediaries, the NN is directly applied to a calorimetric deposition grid. Simulated events for tt and the QCD 
background are provided by the event generator COJETS 6.24 [4]. The NN is a conventional feed-forward network with 
back-propagation training [5] handled by program NEURAL 2.0 [6]. 

By mimicking the event selection criteria used by DO, it is found that the NN results for tt non-leptonic with 
this direct and simpler approach are comparable to those obtained by DO employing jet-reconstruction. 

2 Event selection 

tt and QCD background events are provided by the event generator COJETS 6.24 [4]. For pp interactions at a cm. 
energy =1.8 TeV and a top quark mass = 175 GeV it yields a total tt cross-section o(tt ) = 7. 31 pb. 

A series of cuts is imposed in order to: i) suppress the contribution of leptonic and semi-leptonic decays of the top 
quark; ii) reduce the amount of the background. Where possible, the cuts mimic those used in the DO analysis. 

For the suppression of the top leptonic and semi-leptonic decay contributions, events containing an electron or 
muon with a transverse momentimi pj- > 20 GeV or a missing transverse energy E™™ > 20 GeV are excluded. 

For the sake of background reduction the following requirements are imposed: 

i) Total transverse energy E^' > 120 GeV; 

ii) At least one muon with transverse momentum Pj >4 GeV and absolute pseudorapidity I T7I < 2.5. 

iii) A linear aplanarity > 0.5. Considering only particles with | r/| < 2.5 (neutrinos excluded), after finding the 

— > — > 
thrust, or major axis M, in the transverse energy plane, and defining the minor axis m as the axis orthogonal to it, linear 

— > — > — > — > — > 
aplanarity is defined as = \ E i-m \ / Y^i] E i- M \, where E i is the transverse energy vector of particle / . 

For the integrated luminosity X = 1 10.2 pb-l applying to the DO event sample, the events left after all the cuts are: 

i) 128 events for // ; ii) 3868 events for the background. 

3 Variables 

The region I r/l < 2.5 is covered by a grid of 40 calorimetric cells with Arj x Af = 1 x 45°. Their energy depositions are 
thresholded at 10 GeV, i.e. cells with a deposition less than that are set to deposition zero. That is in order to suppress 
cimiulative effects from the many cells with low depositions, which may disorient the NN. The origin of the azimuth (p 
is set, for each event, by the side of the minor axis m opposite to that of Ei-m . 
Thus, the variables we use as input to the NN are: 

i) The 40 variables given by the energy depositions in the calorimetric cells; 

ii) The total (visible) transverse energy E^' ; 

iii) The maximimi muon transverse momentum observed in the event, max • 
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4 Neural network 

The neural network NN is structured in 2 cascading feed-forward nets, both with 1 hidden layer and 1 output node. The 
first, NNi, has in input only the 40 calorimetric deposition variables and employs 8 hidden nodes. The NNi output, 
Ej^ and Pj^^^ are the input variables of the second feed-forward net, NN2, which employs 2 hidden nodes. Its output, 
which in the following will be referred to as the NN output, varies between 0, the background target, and 1, the top 
signal target. 

The back-propagation training is applied first to NNi, with outputs from the trained net applied to the training 
events stored away. The same set of events (5,000 for each type) is then used for the back-propagation training of NN2. 

5 Results 

For a comparison with the NN results of the DO analysis, we pick up the dependence on the NN output of the ratio 
R = N (tt ^ non-leptonic) / A' (Background) , where numerator and denominator represent the numbers of events 
specified in parentheses. Fig. 1 shows R versus the NN output from the DO results and from this study. 

The results are comparable taking into account that, besides the intrinsic uncertainties of the Monte Carlo 
simulations involved in both calculations, in this study no special effort has been made in optimizing the event cuts, the 
calorimetric segmentation (pretty rough, although economical, as it stands now), the choice of the variables, the NN 
structure. That would have been outside the scope of this study, meant to theoretically examine the viabihty of this 
alternative approach and not aimed to a thorough quantitative interpretation of experimental data, hke in the case of the 
DO analysis. 

5 Conclusions 

Instead of employing an elaborate jet reconstruction, in a neural network analysis of the events it may be convenient to 
directly use the calorimetric map: the neural network itself will care to build its own internal representations of the data 
to discriminate. 

It should be noted that in the place of the deliberately down-to-earth handling of the calorimetric map used here, 
one can extract variables from it by using tools from the well supphed mathematical arsenal for the analysis of 2- 
dimensional data (like, e.g., the currently popular wavelet transform). 
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Fig 1. Dependence of the ratio R = N (tt —> non-leptonic) / N (Background) on the NN output, 
according to the results of the DO analysis and of this study. 



